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(54) Targeted delivery through a cationic amino acid transporter 

(57) The invention relates to the targeted delivery of 
substances to cells. The invention provides a virus-like 
particle or gene delivery vehicle provided with a ligand 
capable of binding to a human amino acid transporter. 
Provided are for example ligands that can bind to the 
human transporter of cationic L-amino acids (hCAT1). 
Such hCAT1 binding molecules find applications in the 
design of vector systems for entry into human or pri- 
mate cells. Preferred are retroviral envelope molecules, 
which - when incorporated in a virus particle - can infect 
hCAT1 positive cells at high frequencies. Also within the 
scope of the invention are methods for the design of 
such hCAT1 binding molecules. 
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Description 

[0001] The invention relates to the targeted delivery of substances to cells. 

[0002] Delivery of substances to ceils allows specific treatment of said cells with compounds that act in the targeted 
5 cell. For example, tumour cells, when targeted with toxic components, selectively die when said toxin is delivered to said 
cell. Yet other cells, when provided with a gene lacking in said cell, can be restored in their function, so-called gene ther- 
apy. 

[0003] Delivery of a compound to a cells preferably occurs with a vehicle or partide that effectively brings the com- 
pound to the desired cell or cells and than delivers said compound into that cell (in vivo or in vitro) where it can exert its 

10 action. For this purpose, for example particles such as virus-like particles are suited. These particles, often derived from 
known viruses, such as retrovirus or adenovirus, are small enough to penetrate in-between tissues and cells and arrive 
at a cell of choice where it for example can fuse with said cell and deliver its compound. Said virus-like particles may or 
may not be infectious in themselves, their main concern is the targeted delivery of the compound of interest, such as a 
gene, a toxin or immunostimulating components such as antigens. 

is [0004] Yet other examples are gene-delivery vehicles, specifically designed to transfer a gene to a cell of interest. 
Virus-like particles capable of delivering a gene are examples of said gene-delivery vehicles, however, also other exam- 
ples of such vehicles, of non-viral origin, such as liposomes or microbodies, or even latex particles, are known. Vehicles 
such as liposomes or microbodies can of course also carry other compounds than a gene, in particular toxic or immu- 
nostimulating components such as antigens can be included in such a vehicle. 

20 [0005] These vehicles or particles all have in common that they need to be provided with a molecule or fragment 
thereof (ligand) capable of binding with said targeted cell, allowing targeting of said particle or vehicles to cells. There 
is a need for specific or broadly applicable ligands that react with cell-surface receptors on cells. In particular there is a 
need for ligands that react with cell-surface receptors after which efficient transfer of said compound to said cell, such 
as a gene, is possible. Especially in human medicine, such a ligand would enable better application of gene-transfer 

25 therapy than is possible now. 

[0006] It has been a long-standing objective to exploit retrovirus technology in human gene therapy applications. How- 
ever, the infection spectrum of retroviruses limits the applications of these viruses in such applications. All known env 
variants have a rather broad infection spectrum in common. Here lies one of the major shortcomings of current recom- 
binant retrovirus technology. For the purpose of gene therapy, retroviruses are very useful vehicles for the transfer of 

30 therapeutic sequences, if proper ligand-receptor targets were available. In conclusion, the concept of the use of retro- 
viruses in human gene therapy is well documented (Gordon and Anderson, 1994; Havenga et al., 1997; Vile et al., 
1996). However, it would be clearly advantageous and desirable to devise a strategy for targeted delivery of retrovi- 
ruses, and modification of the infection spectrum. 

[0007] The invention provides a virus-like particle or gene delivery vehicle provided with a ligand capable of binding 
35 to a human amino acid transporter. The invention provides said particle or vehicles wherein said ligand comprises pep- 
tide molecules or fragments thereof binding said transporter, for example to hC AT1 . The peptides or fragments thereof 
can bind to for example the third extracellular domain of the cationic amino acid transporter hCAT1 or can bind to cells 
expressing this domain of hCAT1 protein on their extracellular cell surface. These hCAT1 binding molecules can be pep- 
tides or antibody fragments displayed on a filamentous phage or as free molecules. In a preferred embodiment, the 
40 invention provides a virus-like particle or gene delivery vehicle for delivery of genes to human cells, however, it is also 
possible to provide said particles or vehicles with other compounds, such as toxins for selective killing or antigens for 
immunisation. 

[0008] In a particular embodiment of the invention, a virus-like particle or gene delivery vehicle is provided comprising 
at least one viral protein provided with said ligand. Included in the present invention is the use of hCAT1 binding ligands 
45 to provide a particle or vehicle that employs hCAT1 to enter a hemopoietic stem cell or any other cell expressing hCAT1 . 
hCAT1 ligands can be incorporated in the envelope of a retrovirus or the capsid of any other viral or non-viral gene 
transfer vehicle such as an adenoviral vector. Incorporation of these hCAT1 binding sequences can be done using tech- 
niques known in the art. 

[0009] The invention provides a virus-like particle or gene delivery vehicle wherein said viral protein comprises an 
so envelope protein. In a preferred embodiment the invention provides a a mutant retroviral envelope that is derived from 
a wild-type ecotropic envelope and which employs hCAT1 to enter the human or primate cell by binding to hCAT1 . Such 
a new retroviral envelope molecule, when incorporated in a retroviral virion, will be able to infect hCAT1 positive cells 
such as human PHSCS at high efficiencies. The mutant retroviral envelopes can be used to pseudotype recombinant 
type C retrovirus including but not limited to murine leukemia retroviral vectors. In a further embodiment of the present 
55 invention these hCAT1 binding envelopes can also be used to pseudotype lentiviral vectors including equine or HIV 
derived lentiviral vectors (Kim etal., 1998; Rizvt and Panganiban, 1992), (Kafrietal., 1997; Poeschla et al., 1996). (Miy- 
oshi et al., 1997; NakJini et al., 1996b). Any hCAT1 ligannds or binding envelope molecules or parts thereof made 
according to the methods described herein or other methods can be ligated into full length mammalian retroviral enve- 
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lope expression constructs and introduced in cell lines expressing and containing all the sequences necessary for the 
generation of infectious and functional retroviral particles, in a preferred embodiment the invention provides a virus-like 
particle or gene delivery vehicle derived from a retrovirus. 

[0010] In yet another embodiment, the invention provides a virus-like particle or gene delivery vehicle wherein said 
5 viral protein comprises a capsid protein. hCAT1 binding sequences or ligand can also be incorporated in the capsid pro- 
teins of adenovirus including but not limited to the HI loop of the knob domain of an adenovirus (Krasnykh el al., 1998) 
preferably an adenovirus which does not bind to the adenoviral receptor CAR1 or MHC1 . This results in an adenovirus 
that enters ceils through hCAT1 . Deduced from mCAT1 absent expression in mouse liver (Closs et al., 1993) an hCAT1 
binding adenovirus does not exhibit liver transduction when administered in vivo. By combining an hCAT1 targeted 
io knob with a ligand for another in vivo target hCAT1 , targeting of an adenovirus can remove an important limitation of in 
vivo use of adenoviral vectors for gene therapy (Sullivan et al.. 1997). In another embodiment an hCATI targeted ade- 
novirus will more efficiently transduce cells that are difficult to transduce such as endothelial cells or smooth muscle 
cells as compared to a wiidtype adenoviral vector including but not limited to an adenoviral vector derived from the ade- 
noviral serotype 5. the invention provides a virus-like particle or gene delivery vehicle derived from an adenovirus. 
is [001 1 ] An hC AT1 targeted adenovirus is useful for local applications of adenoviral vector such as in patients with res- 
tenosis following balloon angioplasty where smooth muscle cells need to be transduced with for example an adenoviral 
vector carrying the ceNOS cDNA. More efficient transduction of these tissues results in lower multiplicity's of infections 
(MOIs) that can be used and therefore less vector associated toxicity to the tissues surrounding the target cells 
(PCT/EP98/00723). 

20 [001 2] In a preferred embodiment, the invention provides a virus-like particle or gene delivery vehicle according to the 
invention wherein said amino acid transporter is a cationic amino acid transporter, preferably a human cationic amino 
acid transporter-1 (hCATI) . In a preferred example of the invention provided in the experimental part, the invention pro- 
vides a virus-like particle or gene delivery vehicle wherein said ligand comprises an amino acid sequence selected from 
Table 2, preferably from the last four different sequences of Table 2 or a sequence functionally related thereto. Various 

25 examples of a ligand having hCATI binding activity are provided, a particularly strong example is a ligand comprising 
at least a part of, comprising minimally 5, more preferably minimally 7 amino acids of the amino acid sequence SVS- 
VGMKPSPRP. 

[0013] In yet another embodiment, the invention provides a virus-like particle or gene delivery vehicle according to 
the invention wherein wherein said ligand comprises a fragment derived from a phage displaying at least one antibody 
30 fragment selected for its capacity to bind with said amino acid transporter, in particular a virus-like particle or gene deliv- 
ery vehicle is provided wherein said antibody fragment comprises an amino acid sequence as shown in Figure 16 or a 
an amino acid sequence functionally equivalent thereto or obtainable by a method as described in the experimental part 
of this description. 

[0014] The invention also provides use of a virus-like particle or gene delivery vehicle according to the invention in 

35 gene-transfer therapy. In numerous gene therapy applications targeted delivery of genes into defined ceils is provided 
by the invention, most notably in the case of in vitro gene transfer into cell types present with low abundance in cell mix- 
tures and in approaches for in vivo gene transfer into cells in a living animal body. In a particular embodiment, the par- 
ticles or vehicles provided by the invention are used for gene therapy using hCATI mediated gene transfer including but 
not limited to mammalian smooth muscle cells or hemopoietic stem cells such as CD34+CD38- or 

40 CD34+(CD33CD38CD71)- cells, including but not limited to adenoviral or retroviral gene transfer vehicles. 

[0015] The invention also provides a method for selecting a filamentous phage expressing a protein capable of bind- 
ing to a ligand comprising constructing a phage library, enriching said library for phages having desired binding charac- 
teristics by at least one round of selection of phages for their capacity to bind to a synthetic peptide derived from said 
ligand. further comprising enriching said library for phages having desired binding characteristics by at least one round 

45 of selection of phages for their capacity to bind to a cell expressing said ligand. 

[001 6] The invention for example provides a peptide phage display to select hCATI binding peptides for incorporation 
in a ligand. To isolate peptides that bind to the third extracellular domain of we employed peptide phage display. A 12 
mer peptide phage display library was purchased from New England Biolabs. This library is constructed in the filamen- 
tous E. coli phage ml 3 and the peptide sequences are displayed as N-terminal fusion proteins with the minor coat pro- 

50 tein pill. The unamplified library had a complexity of 1 .9 x 10 9 different sequences as determined by the suppliers. We 
amplified the library once before using it to select hCATI binding peptide phages. Two targets were used to select for 
peptide displaying phages which bind to the third extracellular domain of hCATI. First the predicted third extracellular 
domain of hCATI was synthesised as a synthetic peptide by Neosystem, Strassbourg. France. The N-terminus of this 
peptide was biotinylated and followed by three amino acid linker residues KRR, followed by the predicted sequence of 

55 the third extracellular domain. Secondly we generated cell lines derived from the human 91 1 cell line that overexpress 
hCATI as judged by steady stat mRNA expression levels. The hCATI expression construct hATRCCI which is a 
pcDNA3 based expression construct of the hCATI cDNA was employed to transfect 91 1 cell lines followed by selection 
for neomycine resistance. A cloned cell line designated k08 was isolated which expresses high levels of hATRCCI 
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derived hCAT1 mRNA. 

[001 7] The invention is further described in the experimental part of this description which is not limiting the invention 
thereta 

5 Exp rlmental part 

[0018] Retroviruses are RNA viruses which efficiently integrate their genetic information into the genomic DNA of 
infected ceils via a reverse-transcribed DNA intermediate. This property of their life-cycle and the fact that parts of their 
genetic material can be replaced by foreign DNA sequences make retroviruses one of the most promising vectors for 

w the delivery of genes in human gene therapy procedures, most notably for gene therapies which rely on gene transfer 
into dividing tissues. Most retroviral vector systems are based on mouse retroviruses and consist of two components, 
i.e. (i) the recombinant retroviral vector carrying the foreign sequences of interest, and (ii) so-called packaging cells 
expressing the structural viral proteins of which the encoding sequences are lacking in the retroviral vector. Expression 
of (i) in (ii) results in the production of recombinant retroviral particles capable of transducing susceptible target cells. 

is [001 9] The inf ectivity and host cell range of the retrovirus particle is conferred by an envelope glycoprotein which spe- 
cifically binds to a receptor molecule on the target cell membrane. The envelope glycoprotein of all known retroviruses 
consists of two associated peptides, which are derived by proteolytic cleavage from the same precursor protein 
encoded by the retroviral envelope (env) gene (Gunzburg and Salmons, 1996; Weiss, 1996) . The amino terminal 
domain encompasses specific binding site(s) for its receptor on the target cell membrane, determining the virus host 

20 range. Within this domain of about 200 amino acids highly conserved sequences are present that are interrupted by two 
segments designated VRA and VRB which vary in sequence and length among various mammalian type C retroviruses 
(Battini et al., 1992). The carboxy terminal peptide, which contains trans-membrane anchor sequences, is assumed to. 
account for the selective uptake of the envelope glycoprotein in the virus particle and to mediate fusion between the 
virus membrane and - depending on the type of virus - the plasma membrane or intracellular vesicle membrane of the 

25 target cell (Januszeski etal., 1997; Thomas etal., 1997). In figure 1 a schematic representation of the structure of MuLV 
env protein is given. Several envelope glycoprotein variants with different infection spectra for mammalian cells have 
been identified (Battini et al., 1992). 

[0020] There are examples or recombinant viruses carrying an amphotropic or GaLV envelope. Recombinant viruses 
carrying an amphotropic or GaLV envelope are capable of infecting human and murine ceils and are commonly used in 
30 gene transfer trials including human gene therapy involving the pluripotent hemopoietic stem cell (PHSC) (Havenga et 
• al., 1997). Gene transfer frequencies into PHSCs of human patients and non human primate animal models have been 
shown to be extremely low and limit therapeutic stem cell gene therapy (Havenga et al.. 1997; Hoogerbrugge et al.. 
1996; Van Beusechem et al., 1993; van Beusechem et al., 1992). 

[0021] One important limiting factor has been shown to be low expression levels of retroviral receptors such as the 
35 one mediating entry of amphotropic MuLV retrovirus (GLVR2) (Orlic et al., 1996; van Es et al., 1996). The quiescent 
state of PHSCs when isolated for ex vivo gene transfer procedures poses another blockade (Knaan-Shanzer et al., 
1996). Murine stem cell gene therapy experiments have traditionally been performed with ecotropic MuLV vectors 
(Havenga et al., 1997). Recombinant viruses carrying an ecotropic envelope are only capable of infecting murine cells. 
Transfer of genes into murine PHSCs using ecotropic retroviral vectors has been shown to result in high transduction 
40 efficiencies in circulating PHSC derived peripheral Wood cells (PBL). The transduction efficiencies are high enough to 
be therapeutic if achieved in human PHSCs reaching levels of PHSC gene transfer varying between 30-80 %. 
[0022] A small number of studies have been performed in which the transduction efficiency into murine PHSCs of eco- 
tropic and amphotropic retroviruses were actually compared directly (Havenga et al., 1997). One of these studies indi- 
cated that infection with amphotropic retrovirus resulted in expression and thus transgene presence for less than 8 
45 weeks whereas infection with ecotropic virus resulted in expression for more than 44 weeks after transplantation 
(Demarquoy, 1993). In a similar study, ecotropic virus was shown to be approximately 50 fold more efficient in transduc- 
ing murine PHSCs as compared to amphotropic retrovirus (Orlic et al., 1996). 

[0023] Ecotropic and amphotropic retrovirus 'differ in the receptor that is employed for virus entry (Albritton et al., 
1989; van Zeijl et al., 1994). Ecotropic virus binds target cells via the ecotropic receptor mCAT1 which is a transporter 
so of cationic L-amino acids (Kim et al., 1991) and amphotropic retrovirus binds target cells via the amphotropic receptor 
GLVR2, a sodium dependent phosphate transporter GLVR2 (Kavanaugh et al.. 1994; Miller and Miller, 1994; van Zeijl 
etal., 1994). 

[0024] A comparative study measuring mRNA levels of both the ecotropic and amphotropic receptors in mouse 
PHSCs (lit c-kitbright) revealed an important difference. This study demonstrated that ecotropic receptor (mCAT1) 
55 mRNA levels in these cells are high whereas amphotropic receptor (GLVR2) mRNA levels were undetectable by RT- 
PCR (Orlic et al.. 1996). GLVR2 expression studies on CD34 + (CD38.CD33.CD71)*(CD34 + lin cells) isolated from 
human bone marrow, umbilical cord blood and immobilised peripheral blood supports these data (van Es et al., 1996). 
[0025] Another important factor which plays a role in determining successful retroviral entry and integration is the 
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postbinding route of entry of a retrovirus particle. The postbinding entry route for ecotropic virus is different from that of 
amphotropic retrovirus. Ecotropic retrovirus tranductions are sensitive to lysosomotropic agents such as chloroquine 
and NH4CI. This suggests that upon binding of the ecotropic retrovirus, the retrovirus is internalised by receptor medi- 
ated endocytosis (McClure et a).. 1990). In contrast upon binding of the envelope of amphotropic retrovirus the viral 
5 envelope directly fuses with the plasma membrane. This is a process that is not disrupted by lysosomotropic agents 
suggesting that the postbinding steps of amphotropic MuLV virus are essentially different from those of ecotropic MuLV 
retrovirus (McClure et al., 1990). 

[0026] The human homologue of the murine ecotropic virus receptor mCAT1 is hCAT1 . Uke mCAT1 mRNA expres- 
sion in mouse PHSCs, hCAT1 mRNA is expressed at high levels in human PHSCs (Orlic et al., 1996). For both mCAT1 
10 and hCAT1 the normal function is the import of cationic amino acids such as lysine and arginine (Albritton et al., 1993; 
Malhotra et al., 1996). The third predicted extracellular domain of mCAT1 includes a sequence YGE. The residues are 
crucial for receptor function. In the nonfunctional hCAT1 the sequence of the third extracellular domain is PGV. Mutation 
of the human sequence into one or two of the residues of mCAT1 results in a hCAT1 protein with ecotropic receptor 
function (Albritton et al., 1 993; Yoshimoto et al., 1 993). See also figure 2. 
15 [0027] A number of mutant ecotropic envelope molecules have been described in the literature. MacKrell et al have 
mutated amino acids within the receptor-binding domain VRA of ecotropic MuLV envelope in order to identify residues 
involved in receptor binding. Virions incorporating mutant envelopes carrying mutations at amino acid residue D84 have 
lost their binding capabilities to the ecotropic receptor mCAT1 (MacKrell et al., 1996). Virions carrying D84 mutated 
envelope protein were tested on human cells to search fa a possible change in receptor recognition specificity but were 
20 found not to infect human cells (Mike Januszeski, personal communication). 

Skov and Andersen have studied ecotropic Moloney envelope interactions with mCAT1 by generation of mutant enve- 
lope molecules with mutated arginine and lysine residues in gp70 including VRA followed by introduction in a replication 
competent retroviral backbone (Skov and Andersen, 1993). Mutations R135G, K137Q, R157G and R159A 
(R102G,K104Q,R124G and R126A without signal peptide respectively) resulted in virions that were not able to repli- 
es cate. 

Kingsman et al have described in PCT application WO96/31602 an insertion site in the VRA domain of ecotropic enve- 
lope which allows modification of the tropism. An integrin binding sequence was inserted resulting in infection of human 
cells expressing the respective integrin. 

[0028] PVC-21 1 murine leukemia virus (MuLV) is a neuropathogenic variant of ecotropic Friend MuLV (F-MuLV) that 
30 causes a rapidly progressive neurodegenerative disease in susceptible rodents. PVC-21 1 MuLV, but not the parental F- 
MuLV, can infect rat brain capillary endothelial ceils (BCEC) efficiently, and the major determinant for BCEC tropism of 
PVC-21 1 MuLV is localized within the env gene. More specific analysis indicated that E1 16G and E129K substitutions 
in the background of the F-MuLV envelope protein were sufficient for conferring BCEC tropism on the virus (Masuda et 
al., 1996a). Host range changes of these mutations were found to include CHO cells normally not infectable with eco- 
35 tropic F-MuLV or M-MuLV. The latter suggests that these mutations overcome a negative effect of CAT1 CHO cell recep- 
tor glycosylation in the region of virus binding in the third extracellular domain of mCAT1 (Masuda et al., 1996b). 
[0029] By employing particular natural env variants the transduction spectrum can be limited to some extend, but true 
specificity for human target cells of interest can not be obtained following this strategy (Masuda et al., 1996a; von Kalle 
et al., 1994; Wilson et al., 1994). , 
40 [0030] In the present invention we describe the expansion of the host range of an ecotropic retrovirus or other gene 
transfer vehicle such as an adenoviral vector resulting in increased transduction of hemopoietic stem cells. In this inven- 
tion, targeted delivery is accomplished by directing the retrovirus particle to cell membrane molecules differing from the 
natural receptor. This could then lead to increased specificity of transduction. 

[0031] The present invention discloses examples of molecules that bind to hCAT1 and that can be used to develop 
45 gene transfer vehicles such as retroviral and adenoviral vectors. In particular, the invention relates to proteins and deriv- 
atives thereof expressed in the lipid bilayer of enveloped virus particles such as retroviruses. Methods, materials, pro- 
cedures and pharmaceutical formulations for the design and preparation of the above molecules and virus particles are 
also part of the invention. These molecules and virus particles have applications in the field of virology, gene therapy, 
biochemistry and molecular biology. 
so [0032] The present invention relates to peptide molecules binding to hCAT1 . These molecules are characterized by 
their ability to bind the third extracellular domain of the cationic amino acid transporter hCAT1 either a synthetic peptide 
encompassing this third extracellular domain or by binding to cells expressing this domain of hCAT1 protein on their 
extracellular cell surface. These hCATI binding molecules can be peptides or antibody fragments displayed on a fila- 
mentous phage or as free molecules. 
55 [0033] Included in the present invention are filamentous phages displaying hCATI binding molecules and that can be - 
used to transfer genes into cells by modification of the phage genome using techniques known in the art. 
[0034] Included in the present invention is the use of hCATI binding molecules to design vectors that employ hCATI 
to enter a HSC or any other cell expressing hCATI . hCATI binding molecules can be incorporated in the envelope of a 
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retrovirus or the capsid of any other viral or non-viral gene transfer vehicle such as an adenoviral vector. Incorporation 
of these hCAT1 binding sequences can be done using techniques known in the art 

[0035] Preferred are mutant retroviral envelopes that are derived from wild-type ecotropic envelope and which employ 
hCAT1 to enter the human or primate cell by binding to hCAT1. These new retroviral envelope molecules, when incor- 

s porated in a retroviral virion, will be able to infect hCAT1 positive cells such as human PHSCs at high efficiencies. The 
mutant retroviral envelopes can be used to pseudotype recombinant type C retrovirus including but not limited to murine 
leukemia retroviral vectors. In a further embodiment of the present invention these hCAT1 binding envelopes can also 
be used to pseudotype lentiviral vectors including equine or HIV derived tentiviral vectors (Kim et al., 1998; Rizvi and 
Panganiban, 1992), (Kafri et al., 1997; Poeschla et at, 1996), (Miyoshi et at, 1997; Naldini et at. 1996b). 

10 [0036] Any hCAT1 binding envelope molecules or parts thereof made according to the methods described herein or 
other methods can be ligated into full length mammalian retroviral envelope expression constructs and introduced in 
cell lines expressing and containing all the sequences necessary for the generation of infectious and functional retrovi- 
ral particles including but not limited to cell lines preferably derived from the adenoviral E1 transformed, human cell line 
PER.C6 (WO97/00326) and that express murine leukemia gag-pol constructs and a retroviral vector containing long 

75 terminal repeats (LTRs), and retroviral RNA packaging signals such as those vectors described in W096/35798. The 
hCAT1 binding envelopes made according to the subject material of this invention can also be used to pseudotype vec- 
tors other than murine leukemia retroviral vectors including but not limited to lentiviral vectors (Naldini et at, 1996a; 
Naldini et at, 1996b). 

[0037] In a further embodiment of the present invention, hCAT1 binding sequences can also be incorporated in the 
20 capsid proteins of adenovirus including but not limited to the HI loop of the knob domain of an adenovirus (Krasnykh et 
at, 1998) preferably an adenovirus which does not bind to the adenoviral receptor CAR1 or MHC1. This results in an 
adenovirus that enters cells through hCAT1 . Deduced from mCAT1 absent expression in mouse liver (Closs et at, 
1993) an hCAT1 binding adenovirus does not exhibit liver transduction when administered in vivo. By combining an 
hCAT1 targeted knob with a ligand for another in vivo target hCAT1 targeting of an adenovirus can remove an important 
25 limitation of in vivo use of adenoviral vectors for gene therapy (Sullivan et at., 1997). In another embodiment an hCAT1 
targeted adenovirus will more efficiently transduce cells that are difficult to transduce such as endothelial cells or 
smooth muscle cells as compared to a wildtype adenoviral vector including but not limited to an adenoviral vector 
derived from the adenoviral serotype 5. 

[0038] An hCATl targeted adenovirus is useful for local applications of adenoviral vector such as in patients with res- 
30 tenosis following balloon angioplasty where smooth muscle cells need to be transduced with for example an adenoviral 
vector carrying the ceNOS cDNA. More efficient transduction of these tissues results in lower multiplicity's of infections 
(MOIs) that can be used and therefore less vector associated toxicity to the tissues surrounding the target cells 
(PCT/EP98/00723). 

[0039] In another aspect of the present invention, the hCATl binding human FAbs that are part of the subject matter 
35 of this invention can be used to measure expression of hCATl molecules on cells that are targets for gene therapy using 
hCATl mediated gene transfer including but not limited to mammalian hemopoetic stem ceils such as CD34+CD38- or 
CD34+(CD33CD38CD71)- cells. This could be part of a procedure aimed at determining when or whether a patients 
cells are most susceptible to gene transfer through hCATl including but not limited to adenoviral or retroviral gene trans- 
fer vehicles. 

40 [0040] The skilled artisan will be able to apply the teaching of the present invention to other virus capsid or envelope 
or non-viral gene transfer molecules or vehicles than those exemplified herein without departing from the present inven- 
. tion and therefore the examples presented are illustrations and not limitations. It is intended that ail such other examples 
be included within the scope of the appended claims. 

45 Example 1 . Sequences of hCATl cDNAs amplified from human CD34+ cells. 

[0041 ] For the purpose of developing gene transfer tools that enter PHSCs through hCATl , specifically through bind- 
ing to the third extracellular domain, we isolated total RNA from a number of different human CD34+ samples and deter- 
mined the cDNA sequence of hCATl (see figure 2). Total RNA was isolated according to the protocol described by 

so Chomczynski et al (ChomczynsW and Sacchi, 1 987). RT-PCR was performed by using the Superscript Preamplif ication 
System for First Strand cDNA Synthesis (Life Technologies). For first strand synthesis random hexamers were used. 
The hCATl cDNA was amplified with two sets of primers, each resulting in a product of approximately 1 kb encompass- 
ing the open reading frame of the hCATl mRNA (Yoshimoto et al., 1991). DNA sequencing was performed by Base- 
Clear, Leiden, The Netherlands using automated sequence analysis. In figure 3a and 3b the results of sequence 

55 analysis of hCATl cDNA isolated from CD34+ cells from mobilized peripheral blood or umbilical cord blood are com- 
piled. Clearly from thes nucleotid sequence analyses (figure 3b ) it can be deduced that indeed in the CD34+ samples 
tested hCATl is expressed and includes the third extracellular domain with predicted sequence KNWQLTEEDFGNTS- 
GRLCLNNDTKEGKPGVGGF which includes the sequence PGV determining function as receptor (see above) . There- 
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fore targeting through this domain or part of this domain of hCAT1 in hemopoietic CD34* cells including but not limited 
to hemopoietic stem cells such as defined by lineage negative phenotypes e.g. CD34 + CD38 is possible. 

Example 2. Peptide phage display to select hCAT1 binding peptides. 

5 

[0042] To isolate peptides that bind to the third extracellular domain of hCAT1 (Albritton et al., 1993) (figure 2) we 
employed peptide phage display. A 12 mer peptide phage display library was purchased from New England Biolabs. 
This library is constructed in the filamentous E. coli phage m13 and the peptide sequences are displayed as N-terminal 
fusion proteins with the minor coat protein pill. The unamplif ied library had a complexity of 1 .9 x 1 0 9 different sequences 

10 as determined by the suppliers. We amplified the library once before using it to select hCAT1 binding peptide phages. 
Two targets were used to select for peptide displaying phages which bind to the third extracellular domain of hCAT1 . 
First the predicted third extracellular domain of hCAT1 was synthesised as a synthetic peptide by Neosystem, Strass- 
bourg, France. The N-ter minus of this peptide was biotinylated and followed by three amino acid linker residues KRR, 
followed by the predicted sequence of the third extracellular domain (figure 4). Secondly we generated cell lines derived 

75 from the human 91 1 cell line that overexpress hCAT1 as judged by steady state mRNA expression levels. The hCAT1 
expression construct hATRCCI which is a pcDNA3 based expression construct of the hCAT1 cDNA (Malhotra et al., 
1996)was employed to transfect 911 cell lines followed by selection for neomycine resistance in 1 mg/ml of G418 
(Genetecin, Life Technologies. Inc). A cloned cell line designated k08 was isolated which expresses high levels of 
hATRCCI derived hCAT1 mRNA (figure 5). 

20 [0043] To select for peptide displaying phages that bind to the putative third extracellular domain of hCAT1 as 
expressed on human cells the following selection procedure was employed. Six rounds of selection on biotinylated 
hCAT1 peptide (figure 4) followed by three rounds of selection on hCAT1 overexpressing cells k08. Initially two separate 
selections were carried out differing in the stringency of washing. Low stringency washing consisted of 3 washes with 
2 % (w/v) milk powder in PBS with 0.05 % (v/v) Tween 20 and 3 washes with PBS. High stringency washing consisted 

25 of 5 washes with 2 % (w/v) milk powder, PBS with 0.05 % Tween 20, 5 washes with PBS, 0.05 % Tween 20 and 5 
washes with PBS. After 1 round of selection on 911-hCAT1-k08 cells eluted phages from both washing procedures 
were pooled and used for a second and third round of selection on 911-hCAT1-k08 cells. The results of these experi- 
ments are depicted in table 1 . Clearly the ratio of input over output increases upon selection on hCAT1 peptide indica- 
tive of selection for binding phages. When selection on hCATl positive cells was started the ratio drops and slightly 

30 increases in the last round on hCATl expressing human cells. 

[0044] After the last round of selection with the hCATl peptide and after each round of cell selection the pools of pep- 
tide displaying phages were tested for binding to immobilized hCATl peptide using an Enzyme Linked Immunosorbent 
Assay (ELISA). 96-well plates were coated with 2 mg/ml biotinylated BSA in PBS and incubated for 1 hour 37 °C after 
which the wells were rinsed 3x for 5 minutes with PBS/0.05% Tween 20. Then the wells were saturated with streptavidin 

35 (10 mg/ml in PBS/0.5% gelatin) for 1 hour at room temperature (RT) and washed 3 times with PBS/0.05 % Tween 20. 
Then the wells were incubated overnight at 4 °C with biotinylated hCATl peptide (figure 4) at a concentration of 10 
mg/ml in PBS. The next day the wells were rinsed two times with PBS/0.1 % Tween 20 and 2x with PBS. Then the wells 
were blocked with 2% non-fat miikpowder in PBS for at least 30 minutes at RT followed by three rinses with with PBS/0. 
1 % Tween and three with PBS. Subsequently an equal volume of 4% non-fat miikpowder in PBS was added to all wells 

40 and culture supernatant or purified phage (PEG precipitated) and incubated for 1.5 hours at RT After this incubation 
the wells were washed three times with PBS/0.1 % Tween 20 and three times with PBS followed by incubation with an 
anti-m13 antibody solution (Pharmacia, 1 :5000 in 2% non-fat miikpowder in PBS) for 1 hour at RT Again the wells were 
washed three times with PBS/0.1 % Tween and three times with PBS followed by the addition of a rabbit-anti goat HRP 
conjugate solution (BioRad, 1:2000 in 2% non-fat miikpowder in PBS) for 1 hour at RT After this incubation the wells 

45 were washed again three times with PBS/0. 1% Tween and three times with PBS. Detection of phage binding was then 
visualized using TMB colour solution (0. 1 mg/ml TMB, 1% DMSO, 1x TMB buffer, 0.001% 30% H 2 0 2 in H 2 0) 20-30 min 
in the dark at RT and stopped with 2 N H 2 S0 4 and read at 450 nm in a microplate reader. Using this hCATl specif ic 
ELISA an enrichment of phages binding to hCATt peptide is achieved (figure 6). Importantly after binding of the peptide 
selected pools to hCATl overexpressing cells eluted phages still bind to hCATT peptide. Clones isolated from round 3 

so on hCATl overexpressing cells were isolated and tested on hCATl peptide ELISA (figure 7). Except 1 , all tested clones 
bound to hCATl peptide and thus to the third extracellular domain displayed on human cells. 
[0045] To confirm enrichment for specific sequences and to determine the amino acid sequence of the 12 mer pep- 
tides displayed, we isolated single stranded m13 phage ONA for automated sequence analysis (Baseclear, Leiden, The 
Netherlands). The oligonucleotide used for sequencing was 5 XCCTCATAGTTAGCGTAACG-3'. We sequenced clones 

55 isolated from the pools of various peptide and cell selections. For this purpose we pooled the eluates of the two different 
washing conditions. In addition to the amplified 12 mer peptide library we only selected clones from peptid rounds 3 
,5 and 6 and cell rounds 1 ,2 and 3. In table 2 the sequences determined for the various clones are given. Clearly a very 
strong selection occurred because all cell selected phage clones displayed one sequence namely: SVSVGMKPSPRP. 



7 



EP 0 959 1 36 A1 



This sequence is also displayed by phages in hCAT1 peptide selected pool 6 in a mixture with 3 other sequences. 
These other phages are lost once the phage pools selected on hCAT1 peptide are selected for binding to hCAT1 over- 
expressing cells. 

[0046] The cloned SVSVGMKPSPRP displaying m13 phage was used in experiments to measure binding of the dis- 
5 played sequence to cells that express hCAT1 . First we did an experiment using the flow cytometer and the 2 cell lines 
91 1 -pcDNA3 and 91 1 -hCATl (k08) . Cells were incubated with 1 0 1 1 phage in 1 00 ml PBS/0.1% BSA for 1 hour at room 
temperature. Subsequently the cells were washed twice with PBS/0.1 % BSA followed by incubation of the cells with 
anti-m13 antibody (Pharmacia, 1:500 in PBS/0.1 % BSA) for 30 min at room temperature and washed twice with 
PBS/0.1 % BSA. Then the cells were incubated with rabbit-anti goat FITC (DAKO, 1 :50 in PBS/0.1 % BSA) for 30 min 
w at RT and washed twice with PBS/0. 1 % BSA. Binding of phage was then measured in the FL1 channel of a Becton and 
Dickinson flow cytometer. As a control we used an identical amount of phage from the amplified 12 mer libary. In figure 
8 the results of this experiment are depicted. Clone #26 phage binds to 91 1 cells and in particular to 91 1 cells that over- 
express hCAT1 . 

[0047] We also measured cell binding of phage incubating hCAT1 expressing cells with phage followed by trtering total 
75 cell bound phage, eluted phage and cell associated phage fractions on E.coli using a standard ml3 plating assay on a 
lawn of E.coli cells. For this purpose E.coli strain ER2537 is grown overnight in LB medium. This overnight culture is 
then used to inoculate 20 ml of fresh LB medium at an ODeoonm of 0.05. Once at an ODgoonm of 0.5 500 ml of the 
ER2537 E.coli bacteria were mixed with 500 ml dilutions of phage samples and incubated at RT for 10 min. Plating on 
a standard LB-agar plate was performed by mixing 3 ml top agar with 200 ml of each sample. Once the topagar was 
20 solidified the plates were transferred upside down to a 37 °C incubator for 1 2-1 4 hours. Plaques were counted and used 
to determine the number of phage particles binding to hCAT1 expressing cells. In table 3 eel) binding is depicted. 
[0048] Clearly from these results we can conclude that the 1 2 mer peptide displaying phage with sequence SVSVG- 
MKPSPRP indeed binds to hCAT1 expressing cells. hCAT1 expressing Cells were also incubated at 37 °C followed by 
elution of bound phages plus cell associated phages were liberated. Both were used in phage titering on E.coli and 
25 clearly a cell associated fraction is detected. This suggests that the phage displaying sequence SVSVGMKPSPRP and 
which bind to hCAT1 also enter a human hCAT1 expressing cell. This feature of sequence SVSVGMKPSPRP could be 
used for the development of gene transfer products useful in gene therapy. 

Example 3. Human FAb phage display to select hCAT1 binding human antibody molecules. 

30 

[0049] To isolate antibodies that bind to the third extracellular domain of hCAT1 (Aibritton et at., 1993) (figure 2,4) we 
employed phages displaying human FAb fragments encompassing the light and heavy variable and constant regions. 
A human FAb phage display library was constructed in phage display vector pCES1 a vector derived from pCANTABS 
(McGuinessetal., 1996). The library was constructed in the filamentous E. coli phagem13and the FAb sequences are 

35 displayed partly as N-terminal fusion proteins with the minor coat protein pill. The unamplified library had a complexity 
of approximately 3.3 x 10 10 different sequences. Two targets were used to select for peptide displaying phages which 
bind to the third extracellular domain of hCAT1 . First the predicted third extracellular domain of hCATl was synthesized 
as a synthetic peptide by Neosystem, Strassbourg, France. The N-terminus of this peptide was biotinylated and fol- 
lowed by three amino acid linker residues KRR, followed by the predicted sequence of the third extracellular domain 

40 (figure 2,4). Secondly we generated cell lines derived from the human 91 1 cell line that overexpress hCATl as judged 
by steady state mRNA expression levels. The hCATl expression construct hATRCCI which is a pcDNA3 based expres- 
sion construct of the hCATl cDNA was employed to transfect 91 1 cell lines followed by selection for neomycins resist- 
ance in 1 mg/ml of G418 (Geneticin, Life Technologies, Inc) . A cloned cell line designated k08 was isolated which 
expresses high levels of hATRCCI derived hCATl mRNA (figure 5). 

45 [0050] To select for FAb displaying phages that bind to the putative third extracellular domain of hC AT1 as expressed 
on human cells the following selection procedure was employed. Four rounds of selection on biotinylated hCATl pep- 
tide (figure 4) followed by three rounds of selection on hCATl overexpressing cells k08. For selection on biotinylated 
hCATl peptide 250 ml of FAb library (or eluted phage from the previous round) was mixed with 250 ml 4% Marvel in 
PBS and equilibrated while rotating at RT for 1 hour. Subsequently biotinylated hCATl peptide (20-500 nM in H 2 0) was 

so added. This mix was incubated on the rotator at RT for 1 hour before 250 ml equilibrated streptavidin-dynabeads in 2% 
Marvel in PBS was added. After incubation on a rotator at RT for 1 5 min the beads with the bound phage were washed 
5 times with PBS/2% Marvel/0.1% Tween, 5 times with PBS /0.1% Tween and 5 times with PBS. Then the phage were 
eluted by incubation with 0. 1 M Tri-ethyl -amine on a rotator at PT for 1 0 min and neutralised in 1 M Tris-HCI pH 7.4. The 
eluted phage were titered and amplified in TG1 before the next selection. For selection on 91 1-hCAT1 cells, the cells 

55 were harvested at a conf luency of about 80 % and suspended in PBS/1 0 % FBS/2 % Marvel to a final concentration of 
at least 3x1 0 6 cells/ml. This cell suspension was incubated for 30 min on a rowing boat mixer (or rotator), while at the 
same time phage were also preincubated in PBS/10 % FBS/2 % Marvel. Then the cells were centrifuged, resuspended 
in the preincubated phage solution and incubated on a rowing boat mixer (or rotator) for 1 hour. Afterwards the cells 
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were washed 10 times with PBS/10 % FBS/2 % Marvel and twice with PBS. The cells were centrifuged and resus- 
pended in 0.6 ml water. Subsequently 0.6 ml 200 mM triethylamine was added (dropwise while vortexing). After 5 min- 
utes the suspension was neutralised by adding 0.6 ml of 1 M Tris-HCI pH 7.4 (dropwise while vortexing).After 
cenrifugation (5 min, 14000 rpm) the supernatant was transferred to a new tube and trtered and amplified in TG1 before 
5 the next selection. The results of these experiments are depicted in table 4. clearly the ratio of input over output 
increases upon selection on hCAT1 peptide indicative of selection for hCATI peptide binding phages. When selection 
on hCATI positive cells was started the ratio dropped and slightly increased in the last round on hCATI expressing 
cells. 

[0051] The pools of the last 3 rounds were tested for binding to the biotinylated hCATI peptide in a hCATI specific 

io ELISA and also for cell binding by flow cytometric analysis (both protocols are described in example 2). After the last 
round of selection on cells the pool of FAb phages still binds to the biotinylated hCATI peptide (figure 9). Flow cytomet- 
ric analysis showed that this pool also binds to hCATI overexpressing cells (figure 10). From this pool 43 clones were 
analysed by fingerprint analysis and divided into 14 different groups. From each group 1 phage clone was tested for 
binding to the biotinylated hCATI peptide in a hCATI specific ELISA and also for cell binding by FACS analysis. Three 

75 clones appeared to be streptavidin binders whereas the other 1 1 clones showed binding to the biotinylated hCATI pep- 
tide (figure 11). Row cytometric analysis revealed that only 1 of the 14 clones showed strong binding to hCATI overex- 
pressing cells (figure 12). This clone was analyzed in more detail (figures 13,14). Clearly clone #25 binds strongly to 
the synthetic hCATI peptide used and to hCATI overexpressing 91 1 k08 cells. Moreover average fold increased binding 
of this phage to 91 1 -hC AT1 -k08 overexpressing cells over 91 1 -pcDNA3 cells was found to be 1 .6 ± 1 .2 fold (figure 1 4). 

20 Double strand phagemid DNA was prepared and used to determine the nucleotide and deduced amino acid sequence 
of the displayed variable heavy and light chains. For a schematic picture of the vector pCES1 in which the library of var- 
iable chains was cloned (see figure 15). The hCATI binding domains are as expected homologous to human immu- 
noglobulin sequences. The complementarity determining regions (CDRs) are indicated in figure 16. 
The sequences of this immunoglobulin can be incorporated in viral or non-viral proteins that mediate binding and entry 

25 to cells and thus create gene transfer vehicles that enter cells through hCATI . The hCATI binding human FAbs can also 
be used to measure expression of hCATI on cells that are targets for gene therapy using hCATI mediated gene trans- 
• fer. 

Example 4 Incorporation of hCATI binding peptides in ecotropic retroviral envelope 

30 

[0052] To include hCATI binding peptides (see example 2) in the context of an ecotropic murine leukemia viral enve- 
lope we used functional display of ecotropic envelope by filamentous phages. We used the construct gplll/env2 which 
encodes a fusion protein consisting of a prokaryotic signal peptide and all of the gp70 protein including the variable 
regions A and B and the polyproline hinge (amino acid residues 34-308) fused to the capsid protein encoded by gene 

35 ill of m13. Numbering of amino acid sequences was done according to the unprocessed envelope sequence as depos- 
ited in the Swiss prot database with accession number P03385 and starting from the viral signal peptide. In table 5 all 
the oligonucleotides are depicted that are used for insertion of the peptide sequences in retroviral envelope. 
[0053] Three sites and ways of peptide insertion have been chosen: (1) Insertion at the BstEII site of the ecotropic 
envelope. (2) Replacement of sequence PFSS (residues 96-99) by each of the 4 the peptides (see table 5). (3) 

40 Replacement of sequence LTSLTP (residues 1 22-1 27) by each of the 4 peptide sequences peptides (see table 5). The 
sequences PFSS and LTSLTP are predicted to be on displayed on the outside of the envelope protein as deduced from 
the structure of crystallised Friend ecotropic envelope (Fass et al. ( 1997). For the BstEII insertion constructs the two sin- 
gle stranded complementary oligonucleotides were synthesised. At the amino acid sequence level linker amino acids 
were included at the amino and carboxyterminus of the inserted peptide sequence. These single stranded oligonucle- 

45 otides were then mixed in equimolar fashion heated to 95 °C and slowly cooled to room temperature to allow hybridisa- 
tion of the complementary molecules to double stranded DNA. Annealing was followed by BstEII digestion and 
separation on a 2 % agarose gel run in TAE-buffer. DNA was then excised from the gel and purified using Qiaquick gel 
extraction kit (Qiagen.Germany). At the same time double stranded phagemid DNA of construct gplll/env2 was 
digested and thus linearized with BstEII. Linearized gpll(/env2 DNA was subjected to an incubation with the thermosen- 

50 sitive alkaline phosphatase TSAP (Life technologies), then mixed in various molar ratios with double stranded BstEII 
digested oligonucleotides encoding any of the 4 hCATI binding peptides (see table 5b).Then 1 unit of T4 ligase and T4 
ligase buffer supplemented wit 1 mM ATP as added. The ligation mixture was incubated for 1 hour at + 20 °C. The liga- 
tion mixtures were then transformed into Max DH5a competent bacteria (Life technologies). Ampicillin resistant colo- 
nies were picked and subjected to a PCR with one of the 4 primers in table 5c and primer Ecoenvl 2 (see table 5). This 

55 PCR allows one to determine the nature of the inserted sequence and its orientation. Plasmid DNA of colonies with cor- 
rect orientation of insert DNA was then isolated using Qiagen columns and sequenced (Baseclear, Leiden) to confirm 
the complete sequence of the inserts and boundaries plus their orientation. 

[0054] For the insertion/deletion of hCATI binding peptide sequences into gplll/env2 at the LTSLTP and PFSS posi- 
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tions two fragments were amplified (primary PCR) using Elongase polymerase and the following two pairs of primers: 
Fragment 1 : Ecoenv17 (sense primer, table 5c) plus one of the even numbered oligonucleotides in table 5a. Fragment 
2: Ecoenv12 or ecoenvOS (antisense, table 5c) plus an odd numbered primer in table 5a. Fragment 1 harbours at the 
DNA lev I the 3' end whereas fragment 2 harbours the peptide insertion at the 5' end. Becaus both fragments have 

5 identical 3' (fragment 1) and 5' ends (fragment 2) they can be used to assemble a full double stranded DNA fragment 
encompassing the ecotropic envelope sequence between and including part of the ecoenv17 and ecoenv12 oligonu- 
cleotide sequences. This is done by first purifying the two fragments from the primary PCR using Qiaquick PCR purifi- 
cation columns to remove all remaining primers followed by doing a PCR using the two overlapping fragments, and 
primers ecoenvl 7 and ecoenvl 2, and all the components necessary for DNA amplification using Elongase. This step 

10 results in the assembly of a fragment harbouring the 12 mer hCAT1 binding peptide insertions and result in the deletion 
of the LTSLTP or PFSS sequence. These fragments are purified and digested with Notl and PinA1 resulting in a DNA 
fragment of approximately 519 basepairs which was isolated from an agarose gel using Qiagen DNA isolation kit The 
519 basepair fragments were then ligated into a Notl and PinA1 digested gpll!/env2 Surfscript fragment of approxi- 
mately 4000 basepairs using T4 ligase as described above in example 3. E coli bacteria are then transformed with the 

75 ligation mixture, ampiciilin colonies picked, plasmid DNA isolated and analyzed for the presence of 51 9 basepair inserts 
using Notl and PinA1 restriction enzymes and DNA agarose gel electrophoresis. Plasmids with appropriate inserts 
were then further verified by automated DNA sequencing of the inserts (Baseclear, Leiden). 
[0055] Phages displaying envelope with the various peptide inserts can then be produced and tested for their binding 
to and entry of hCAT1 expressing cells and compared to phages displaying the gpl I l/env2 construct ThehCATI binding 

20 envelopes can then be used to develop retroviral vectors produced by mammalian cell lines. 

Example 5 Soluble FAb generation and binding to human cells 

[0056] To prepare soluble FAb fragments of the hCATI binding FAb phage clone periplasmic fractions were made 
25 from HB2151 bacteria infected with clone # 25 phage. Infectedbacteria were grown in LB medium with 2% glucose and 
100 mg/ml ampiciilin (LBGA) overnight while shaking at 30 °C. The next day the infected cells were diluted 1 :1 00 in 50 
ml fresh LBGA and grown at 37 °C until the OD 600 was 0.8. Bacteria were then pelleted followed by resuspension in 25 
ml LB, 100 mg/ml ampiciilin and 1 mM IPTG and incubation for 3 hours at 30 °C while shaking vigorously. Then the bac- 
teria were pelleted and resuspended in 1 ml ice-cold PBS followed by a 14-16 hour incubation at 4 °C while rotating. 
30 The next day the periplasmic fraction was cleared from bacterial residues by centrifugation: once for 1 0 minutes at 8000 
rpm C (Eppendorf centrifuge #5402) at 4 °C followed by a spin of 1 0 minutes at 1 4000 rpm, 4 °C . Then the FAb peri- 
plasmic fractions were aliquoted and stored at -20 °C. The presence and expression of FAb fragments was confirmed 
by doing a dot blot and probing for human kappa light chains with anti-human kappa polyclonal rabbit antibodies (Dako 
A0191 , 1 :1000 dilution, 60 minutes) and anti rabbit IgG (H + L) antibodies conjugated with horse radish peroxidase (Bio- 
35 rad. 170-6515, 1 :20.000, 60 minutes). Each step was followed by washing 6 times with PBS, 0.05 % Tween 20 (v/v). 
Final detection of human FAbs was done using ECL staining (Amersham). This revealed the presence of high concen- 
trations of soluble FAb fragment of hCATI binding clone # 25. Dilutions of antibodies were made in PBS, 0.5 % BSA 
(w/v) , 0.05 % Tween 20 (v/v). 

[0057] The FAb fractions made as described above were then used to perform flow cytometric analyses in 911- 
40 hCAT1-k08 cells expressing hCATI and compared to phages displaying clone 25 (see example 3) (figure 17). For this 
purpose cells were incubated with 100 ml periplasmic fraction of clone 25 or control clone for 1 hour at room tempera- 
ture followed by washing twice with PBS, 0.1 % BSA and incubation with 500 times diluted anti-human kappa light chain 
antibodies (see above) for 30 minutes at room temperature. This was followed by washing twice with PBS/0.1 % BSA 
and a 30 minute room temperature incubation with goat-anti-rabbit immunoglobulin antibodies conjugated with phyco- 
45 erythrin (diluted in 1:20 in PBS/0.1 % BSA, Sigma P9795) and measurement in a flow cytometer. Detection of phage 
binding was done as described under example 2. 

[0058] Clearly FAb preparations of clone 25 bind to hCATI expressing cells whereas FAb fragments of an irrelevant 
CHO cell binding clone did not The results are very similar to the results observed with phages displaying FAb clone 
25 (figure 17). Compared to phages displaying hCATI binding FAb 25, FAb fragments of clone 25 facilitate the meas- 
50 urement of hCATI in a multiparameter setting such as CD34 + or CD34*lin cells. 
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Table 1 





Stringent selection 


Non-stringent selection 


Selection 


Input phages 


Output 
phages 


Output input 


Input phages 


Output 
phages 


Output input 


500 nanoM 
peptide 


1.4 x10 10 


0.6 x10 5 


4.3 x10" 6 


1.4 x10 10 


1.5 X10 5 


1.1 x10* 5 


500 nanoM 
peptide 


3.8 x10 9 


1.1 x10 6 


2.9 x10" 4 


3.8 x10 s 


9.8 x 10 5 


2.6 x10 3 


500 nanoM 
peptide 


7.6 x10 7 


3x10 4 


3.9 x10 -4 


3.8 x10 7 


9.0x10 3 


2.4 x10" 4 


500 nanoM 
peptide 


2.1 x10 8 


6x10 5 


2.9 x10" 3 


1.1 x10 8 


6.8x10 5 


6.2 x 10 -3 


100 nanoM 
peptide 


5.3 x10 10 


2.4 x10 9 


4.5 x10' 2 . 


6.7 x10 10 


1.5 x10 9 


2.2 x10 2 
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Table 1 (continued) 





Stringent selection 


Non-stringent selection 


Selection 


Input phages 


Output 
phages 


Output input 


Input phages 


Output 
phages 


Output input 


100 nanoM 
peptide 


1.2x10 11 


1.8x10 10 


1.5 x10* 1 


1.0x10 11 


5.0 x10 10 


5.0 x10" 1 


hCATt cells 
(k08) 


9.8 x10 11 


1.1 x10 6 


1.1 x10' 6 


1.1 x10 12 


7.2 x 10 8 


6.5 x10* 4 


hCAT1 cells 
(k08) 


ND 


2x10 4 










hCAT1 cells 
(k08) 


2.6X.10 10 


2.1 x10 5 


8.1 x10' 6 









Table 2 



9fl 
cv 


Round 


Target 


insen sequence 


no or laeniicai clones 




Mmpiniea iiurary 


None 


PHQRPQ\A/ni TPT 
\4 I nULLnrxrrOr 


1 ! 

1 
1 


25 






YMHEPITPNPVT 








WHHIPNSAKISL 
SENLTLMTVLQM 
NLMPPPVPRLPL 


1 

1 
1 


30 






TPQGVHYHPNMR 


1 




1 


hCAT1 peptide 


ND 






2 


hCAT1 peptide 


ND 




35 


3 


hCAT1 peptide 


TLNNHTTPPAWN 


1 






QWHSPFPTSRP 


1 




4 


hCAT1 peptide 


ND 






5 


hCAT1 peptide 


FEQHNWWDSHPQ 


1 


40 






NTFD LWLQSVPQ 


7 




6 


hCAT1 peptide 


FEGCHPQSGLSC 
FEQHNWWDSHPQ 


1 
1 


45 






NTFDLWLQSVPQ 
SVSVGMKPSPRP 


5 
4 




1 


HCAT1 cells 


SVSVGMKPSPRP 


4 




2 


hCAT1 cells 


SVSVGMKPSPRP 


4 


50 


3 


hCAT1 cells 


SVSVGMKPSPRP 


23 
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Table 3 



Binding and internallsation of phages displaying peptide SVSVGMKP- 

SPRP 


Cell-line: 


Temp: 


Phage rescue: 


#pfux1000: 


















clone #26 


12-mer library 












911-hCAT1 


37C 


Elution 


120 


4.32 






Lysis 


72 


18.72 


911-hCAT1 


37C 


Whole sample lysis 


205.2 


88.2 












911-pcDNA3 


37C 


Elution 


55.68 


3.84 ; 






Lysis 


47.52 


8.64 


911-pcDNA3 


37C 


Whole sample lysis 


216 


0 



Table 4 



Selection 


Input phages 


output phages 


Output/input 
ratio 


500 nanoM peptide 


2.7 x10 12 


9.0 x10 5 


3.6 x10' 7 


500 nanoM peptide 


5.7 x10 12 


2.0 X10 6 


3.3 x 10* 7 


100 nanoM peptide 


9.5 x10 12 


1.5x10 10 


1.6 x10- 3 


20 nanoM peptide 


7.0 x10 12 


3.7 x10 10 


5.2 x10' 3 


hCAT1 cells (k08) 


7.0 x10 12 


3.0 x10 6 . 


4.4 x10' 7 


hCAT1 cells (k08) 


5.4 x10 12 


1.7 x10 7 


3.1 x 10' 6 


hCAT1 cells (k08) 


5.4 x10 12 


1.5x10 7 


2.8 x10~ 6 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Introgene B.V. 

(B) STREET: Wassenaarseweg 72 

(C) CITY: Leiden 

(D) STATE: Zuid-Holland 

(E) COUNTRY : the Netherlands 

(F) POSTAL CODE (ZIP): 2333 AL 

(ii) TITLE OF INVENTION: Targeted delivery through a cationic 
amino 

acid transporter, 
(iii) NUMBER OF SEQUENCES: 59 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

<v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 98201693.3 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



. <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Lys Asn Trp Gin Leu Thr Glu Glu Asp Phe Gly Asn Thr Ser Gly Arg 
1 5 10 15 

Leu Cys Leu Asn Asn Asp Thr Lys Glu Gly Lys Pro Gly Val Gly Gly 
20 25 30 

Phe 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: "20 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCCTCATAGT TAGCGTAACG 
20 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Ser Val Ser Val Gly Met Lys Pro Ser Pro Arg Pro 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO* 



■ (xi) SEQUENCE ASCRIPTION: SEQ ID NO: 4: 

Glu Gin Ser Arg ?ro Ser Trp Gin Leu Thr Pro Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

>(C) STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ IQ NO: 5: 
Gin Thr His Gin Leu Leu Arg Lys Pro Pro Ser Phe 
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15 10 
(2) INFORMATION FOR SEQ ID NO: 6: • 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Tyr Met His Glu Pro He Thr Pro Asn Pro Val Thr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
,(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Trp His His He Pro Asn Ser Ala Lys lie Ser Leu 
1 5 10 

(2) INFORMATION. FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Ser Glu Asn Leu Thr Leu Met Thr Val Leu Gin Met 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asn Leu Met Pro Pro Pro Val Pro Arg Leu Pro Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(8) TYPE: amino, acid 
(C) STRANDEDNESS : unknown 
<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Thr Pro Gin Gly Val His Tyr His Pro Asx Asn Met Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Thr Leu Asn Asn His Thr Thr Pro Pro Ala Trp Asn 

1 5 ' ' 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
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(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Gin Val Val His Ser Pro Phe Pro Thx Ser Arg Pro 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRAHDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Phe Glu Gin His Asn Trp Trp Asp Ser His Pro Gin 
.1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids \ 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Asn Thr Phe Asp Leu Trp Leu Gin Ser Val Pro Gin 
1 '5 10 

(2) INFORMATION JFOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



22 



EP 0 959 136 A1 



Phe Glu Gly Cys His Pro Gin Ser GXy Leu Ser Cys 
1 5 .10 

(2) INFORMATION FOR SEQ ID HO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTTGAGCAGC ATAATTGGTG GGATTCGCAT CCTCAGCCCC CGGGGCCCCC TTGT 
54 

(2) INFORMATION FOR SEQ ID NO: 17 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CTGAGGATGC GAATCCCACC AATTATGCTG CTCAAAGGAT TGATATTCTA GCCC 
*54 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
.(D) TOPOLOGY: unknown 

|ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



•Jxi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: . 

TTTGAGCAGC ATAATTGGTG GGATTCGCAT CCTCAGCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid ■ 

(C) STRANDEDNESS: unknown 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CTGAGGATGC GAATCCCACC AATTATGCTG CTCAAAAGGT TCTTCGCAGT CTCT 
54 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AATACTTTTG ATCTTTGGCT GCAGTCTGTT CCTCAGCCCC CGGGGCCCCC TTGT 
54 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base. pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CTGAGGAACA GACTGCAGCC AAAGATCAAA AGTATTGGAT TGATATTCTA GCCC 
54 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS: unknown . 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

AATACTTTTG ATCTTTGGCT GCAGTCTGTT CCTCAGCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CTGAGGAACA GACTGCAGCC AAAGATCAAA AGTATTAGGT TCTTCGCAGT CTCT 
54 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 54 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

TCTGTTTCTG TGGGTATGAA GCCGAGTCCT AGGCCTCCCC CGGGGCCCCC TTGT 
54 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: 'SEQ ID NO; 25: 

AGGCCTAGGA CTCGGCTTCA TACCCACAGA AACAGAGGAT TGATATTCTA GCCC 
54 

(2) INFORMATION FOR SEQ ID NO: 26: 

|i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

TCTGTTTCTG TGGGTATGAA GCCGAGTCCT AGGCCTCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

AGGCCTAGGA CTCGGCTTCA TACCCACAGA AACAGAAGGT TCTTCGCAGT CTCT 
54 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TTTGAGGGGT GTCATCCTCA GTCGGGGCTG TCTXGTCCCC CGGGGCCCCC TTGT 
54 
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(2) INFORMATION FOR SEQ ID NO: 29: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

ACAAGACAGC CCCGACTGAG GATGACACCC CTCAAAGGAT TGATATTCTA GCCC 
54 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY; unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

TTTGAGGGGT GTCATCCTCA GTCGGGGCTG TCTTGTCGGT GCAACACTGC CTGG 
54 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 54 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

ACAAGACAGC CCCGACTGAG* GATGACACCC CTCAAAAGGT TCTTCGCAGT CTCT 
54 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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{ D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: * 

ATCACCTGGG AGGTAACCGG CCATATGTTT GAGCAGCATA ATTGGTGGGA TTCGCATCCT 
60 

CAGGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
92 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE.CHARACTERISTICS: 

(A) LENGTH : 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGATCTCCAT TGGTTACCAA GCTAGCACCC TGAGGATGCG AATCCCACCA ATTATGCTGC 
60 

TCAAACATAT GGCCGGTTAC CTCCCAGGTG AT 
92 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ATCACCTGGG AGGTAACCGG* CCATATGAAT ACTTTTGATC TTTGGCTGCA GTCTGTTCCT 
60 

CAGGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
92 

(2). INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
(0) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

CGATCTCCAT TGGTTACCAA GCTAGCACCC TGAGGAACAG ACTGCAGCCA AAGATCAAAA 
60 

GTATTCATAT GGCCGGTTAC CTCCCAGGTG AT 
92 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 36: , 

ATCACCTGGG AGGTAACCGG CCATATGTCT GTTTCTGTGG GTATGAAGCC GAGTCCTAGG 
60 ' 

CCTGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
.92 

(2). INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: • 

CGATCTCCAT TGGTTACCAA GCTAGCACCA GGCCTAGGAC TCGGCTTCAT ACCCACAGAA 
60 

ACAGACATAT GGCCGGTTAC CTCCCAGGTG AT . 
92 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

<ii> MOLECULE TYPE: other nucleic acid 
(lii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38; 

ATCACCTGGG AGGTAACCGG CCATATGTTT GAGGGGTGTC ATCCTCAGTC GGGGCTGTCT 
60 

TGTGGTGCTA GCTTGGTAAC CAATGGAGAT CG 
92 

(2) INFORMATION FOR SEQ ID NO: 39: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CGATCTCCAT TGGTTACCAA GCTAGCACCA CAAGACAGCC CCGACTGAGG ATGACACCCC 
.60 

TCAAACATAT GGCCGGTTAC CTCCCAGGTG AT 
92 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(CJ STRANDEDNESS: unknown 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Gly His Met Phe Glu Gin His Asn Trp Trp Asp Ser His Pro Gin Gly 
1 5 10 15 
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Ala Ser Leu Val Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown • 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Gly His Met Asn Thr Phe Asp Leu Trp Leu Gin Ser Val Pro Gin Gly 
1 5 10 15 

Ala Ser Leu Val Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: . 
<A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE ' TYPE : peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Gly His Met Ser Val Ser Val Gly Met Lys Pro Ser Pro Arg Pro Gly 
1 S 10 15 

Ala Ser Leu Val Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
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Giy His Met Phe Glu Gly Cys His Pro Gin Ser Gly Leu Ser Cys Gly 
1 5 10 15 

5 Ala Ser Leu Val Thr 

20 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: other nucleic acid 
75 ... (iii) HYPOTHETICAL: NO 



20 



25 



30 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

TGAGCAGCAT AATTGGTGGG 
20 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO . 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

TTGATCTTTG GCTGCAGTCT 
20 

(2) INFORMATION FOR SEQ ID NO: 46: 

40 ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

45 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 6: 

TCTGTGGGTA TGAAGCCGAG 
20 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS : ■ 
(A) LENGTH: 20 base pairs 
(B> TYPE: nucleic acid 
(C) STRANDEDNESS : unknown 
(D> TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TTTGAGGGGT GTCATCCTCA 
20 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

GTCCTAGATT TTGGTATCTG 
20 

(2) INFORMATION FOR SEQ ID NO: 4 9: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO ' - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CTCGCTCGCC CATATGCGGC 'CGCAGGTCTC CTCCTCTTAG CAGCACAACC AGCAATGGCC 
60 

GCTTCGCCCG GCTCC 
75 

(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

AGCATCACTA GTCGCCGGTG GAAGTTG 
27 

(2) INFORMATION FOR SEQ ID NO: 51: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

AAAAACTGGC AGCTCACGGA GGAGGATTTT GGGAACACAT CAGGCCGTCT C 
51 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Lys Asn Trp Gin Leu Thr Glu Glu Asp Phe Gly Asn Thr Ser Gly Arg 
1 5 10 15 

Leu 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 
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(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

TGTTTGAACA ATGACACAAA AGAAGGGAAG CCCGGTGTTG GTGGATTC 
48 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: .NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Cys Leu Asn' Asn Asp Thr Lys Glu Gly Lys Pro Gly Val Gly Gly Phe 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

. (ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Lys Arg Arg Asn Asn Asp Thr Lys Glu Gly Lys Pro Gly Vai Gly Gly 
15 10 15 

Phe Met Pro Phe Gly Phe Ser Gly Val Leu Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown . 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 
{iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Lys Arg Arg Asn Asn Asp Thr Asn Val Lys Tyr Gly Glu Gly Gly Phe 
1 5 10 15 

Met Pro Phe Gly Phe Ser Gly Val Leu Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5925 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 
60 

CTTAGACGTC AGGTGGCACT TTTCGGGGAA* ATGTGCGCGG AACCCCTATT TGTTTATTTT 
120 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 
180 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 
240 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 
300 

CTGAAGATCA GTTGGGTGCC CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 
360 

TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC 
420 

TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
480 

ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 
540 

GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 
600 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 
660 

GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 
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720 

ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 
780 

GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 
840 

TTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
900 

GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 
960 

CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 
1020 . 

AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 
1080 

CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 
1140 

TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 
1200 

CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 
1260 

GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 
1320 

TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC 
1380 

TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 
1440 

TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 
1500 

GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 
1560 

CGTGCATACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 
1620 

AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 
1680 

GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 
1740 

ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 
1800 

GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 
1860 

GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA 
1920 

TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT 
I960 

CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC. GCGCGTTGGC 
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2040 

CGATTCATTA ATGCAGCTGG CACGACAGGT 
2100 

ACGCAATTAA TGTGAGTTAG CTCACTCATT 
2160 

CGGCTCGTAT GTTGTGTGGA ATTGTGAGCG 
2220 

ACCATGATTA CGCCAAGCTT TGGAGCCTTT 
2280 

TTATTCGCAA TTCCTTTAGT TGTTCCTTTC 
2340 

ACGCAGTCTC CAGGCATCCT GTCTTTGTCT 
2400 

GCCAGTCAGA GTGTCAGCAG CAGGAACTTA 
2460 

CCCAGGCTCC TCATCTATGG TGTATCCAAC 
2520 

GGCAGTGGGT CTGGGGCAGA CTTCACTCTC 
2580 

GCGGTGTATT ACTGTCAGCG GTATGGCAGG 
2640 

GTGGAGATCA AACGTGGAAC TGTGGCTGCA 
2700 

GAGCAGTTGA AATCTGGAAC TGCCTCTGTT 
2760 

GAGGCCAAAG TACAGTGGAA GGTGGATAAC 
2820 

GTCACAGAGC AGGACAGCAA GGACAGCACC 
'2880 

AAAGCAGACT ACGAGAAACA CAAAGTCTAC 
2940 

TCACCGGTGA CAAAGAGCTT CAACAGGGGA 
3000 

CAAGGAGACA GTCATAATGA AATACCTATT 
3060 

CGCGGCCCAG CCGGCCATGG CCCAGGTCCA 
3120 

GCCTGGGAGG TCCCTGAGAC TCTCCTGTGC 
3180 

TATGCACTGG GTCCGCCAGG CTCCAGGCAA 
3240 

TGATGGAAGC AATAAATACT ACGCAGACTC 
3300 

CAATTCCAAG AACACGCTGT ATCTGCAAAT 



TTCCCGACTG GAAAGCGGGC AGTGAGCGCA 
AGGCACCCCA GGCTTTACAC TTTATGCTTC 
GATAACAATT TCACACAGGA AACAGCTATG 
TTTTTGGAGA TTTTCAACGT GAAAAAATTA 
TATTCTCACA GTGCACTTGA AACGACACTC 
CCGGGGGCAG GAGCCACCCT CTCCTGCAGG 
GCCTGGTACC AGCAGAAACC TGGCCAGGCT 
AGGGCCACTG GCGTCCCAGA CAGGTTCAGT 
ACCATCAACA GACTGGAGCC TGAAGATTTT 
TCACTGTGGA CGTTCGGTCA AGGGACCAAG 
CCATCTGTCT TCATCTTCCC GCCATCTGAT 
GTGTGCCTGC TGAATAACTT CTATCCCAGA 
GCCCTCCAAT CGGGTAACTC CCAGGAGAGT 
TACAGCCTCA GCAGCACCCT GACGCTGAGC 
GCCTGCGAAG TCACCCATCA GGGCCTGAGT 
GAGTGTTAAT AAGGCGCGCC AATTCTATTT 
GCCTACGGCA GCCGCTGGAT TGTTATTACT 
GCTGGTGCAG TCTGGGGGAG GCGTGGTCCA 
AGCCTCTGGA TTCACCTTCA GTAGCTATGC 
GGGGCTGGAG TGGGTGGCAG TTATATCATA 
CGTGAAGGGC CGATTCACCA TCTCCAGAGA 
GAACAGCCTG AGAGCTGAGG ACACGGCTGT 
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3360 

GTATTACTGT GCGAGAGGGA TTACAGTAAC 
3420 

CACCCTGGTC ACCGTCTCAA GCGCCTCCAC 
3480 

CTCCTCCAAG AGCACCTCTG GGGGCACAGC 
3540 

CCCCGAACCG GTGACGGTGT CGTGGAACTC 
3600 

CCCGGCTGTC CTACAGTCCT CAGGACTCTA 
3660 

CAGCAGCTTG GGCACCCAGA CCTACATCTG 
3720 

GGTGGACAAG AAAGTTGAGC CCAAATCTTG 
3780 

GGCCGCAGAA CAAAAACTCA TCTCAGAAGA 
3840 

AAGTTGTTTA GCAAAACCTC ATACAGAAAA 
3900 

AACTTTAGAT CGTTACGCTA ACTATGAGGG 
3960 

TTGTACTGGT GACGAAACTC AGTGTTACGG 
4020 

TGAAAATGAG GGTGGTGGCT CTGAGGGTGG 
4080 

CGGTACTAAA CCTCCTGAGT ACGGTGATAC 
4140 

TCTCGACGGC ACTTATCCGC CTGGTACTGA 
' 4200 

TGAGGAGTCT CAGCCTCTTA ATACTTTCAT 
4260 

GGGTGCATTA ACTGTTTATA CGGGCACTGT 
4320 

TTACCAGTAC ACTCCTGTAT CATCAAAAGC 
4380 

CAGAGACTGC GCTTTCCATT CTGGCTTTAA 
4440 

CCAATCGTCT GACCTGCCTC AACCTCCTGT 
4500 

TGGTGGCGGC TCTGAGGGTG GCGGCTCTGA 
4560 

GGGTGGCGGT TCCGGTGGCG GCTCCGGTTC 
4620 

CGCTAATAAG GGGGCTATGA CCGAAAATGC 



TAAATCACGA TTTGACTACT GGGGCCAGGG 
CAAGGGCCCA TCGGTCTTCC CCCTGGCACC 
GGCCCTGGGC TGCCTGGTCA AGGACTACTT 
AGGCGCCCTG ACCAGCGGCG TCCACACCTT 
CTCCCTCAGC AGCGTAGTGA CCGTGCCCTC 
CAACGTGAAT CACAAGCCCA GCAACACCAA 
TGCGGCCGCA CATCATCATC ACCATCACGG 
GGATCTGAAT GGGGCCGCAT AGACTGTTGA 
TTCATTTACT AACGTCTGGA AAGACGACAA 
CTGTCTGTGG AATGCTACAG GCGTTGTGGT 
TACATGGGTT CCTATTGGGC TTGCTATCCC 
CGGTTCTGAG GGTGGCGGTT CTGAGGGTGG 
ACCTATTCCG GGCTATACTT ATATCAACCC 
GCAAAACCCC GCTAATCCTA ATCCTTCTCT 
GTTTCAGAAT AATAGGTTCC GAAATAGGCA 
TACTCAAGGC ACTGACCCCG TTAAAACTTA 
CATGTATGAC GCTTACTGGA ACGGTAAATT 
TGAGGATCCA TTCGTTTGTG AATATCAAGG 
CAATGCTGGC GGCGGCTCTG GTGGTGGTTC 
GGGTGGCGGT TCTGAGGGTG GCGGCTCTGA 
CGGTGATTTT GATTATGAAA AAATGGCAAA 
CGATGAAAAC GCGCTACAGT CTGACGCTAA 
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4680 

AGGCAAACTT GATTCTGTCG CTACTGATTA CGGTGCTGCT ATCGATGGTT TCATTGGTGA 
4740 

CGTTTCCGGC CTTGCTAATG GTAATGGTGC TACTGGTGAT TTTGCTGGCT CTAATTCCCA 
4800 • 

AATGGCTCAA GTCGGTGACG GTGATAATTC ACCTTTAATG AATAATTTCC GTCAATATTT 
4860 

ACCTTCTTTG CCTCAGTCGG TTGAATGTCG CCCTTATGTC TTTGGCGCTG GT AAACCATA 
4920 

TGAATTTTCT ATTGATTGTG ACAAAATAAA CTTATTCCGT GGTGTCTTTG CGTTTCTTTT 
4980 

ATATGTTGCC ACCTTTATGT ATGTATTTTC GACGTTTGCT AACATACTGC GTAATAAGGA 
5040 

GTCTTAATAA QAATTCACTG GCCGTCGTTT TACAACGTCG TGACTGGGAA AACCCTGGCG 
5100 

TTACCCAACT TAATCGCCTT GCAGCACATC CCCCTTTCGC CAGCTGGCGT AATAGCGAAG 
5160 

AGGCCCGCAC CGATCGCCCT TCCCAACAGT TGCGCAGCCT GAATGGCGAA TGGCGCCTGA 
5220 

TGCGGTATTT TCTCCTTACG CATCTGTGCG GTATTTCACA CCGCATATAA ATTGTAAACG 
5280 

TTAATATTTT GTTAAAATTC GCGTTAAATT TTTGTTAAAT CAGCTCATTT TTTAACCAAT 
5340 

AGGCCGAAAT CGGCAAAATC CCTTATAAAT CAAAAGAATA GCCCGAGATA GGGTTGAGTG 
5400 

TTGTTCCAGT TTGGAACAAG AGTCCACTAT TAAAGAACGT GGACTCCAAC GTCAAAGGGC 
5460 i 

GAAAAACCGT CTATCAGGGC GATGGCCCAC TACGTGAACC ATCACCCAAA TCAAGTTTTT 
*520 

TGGGGTCGAG GTGCCGTAAA GCACTAAATC GGAACCCTAA AGGGAGCCCC CGATTTAGAG 
5580 

CTTGACGGGG AAAGCCGGCG AACGTGGCGA GAAAGGAAGG GAAGAAAGCG AAAGGAGCGG 
5640 

GCGCTAGGGC GCTGGCAAGT GTAGCGGTCA CGCTGCGCGT AACCACCACA CCCGCCGCGC 
5700 

TTAATGCGCC GCTACAGGGC GCGTACTATG GTTGCTTTGA CGGGTGCAGT CTCAGTACAA 
5760 

TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC GCTGACGCGC 
5820 

CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC GTCTCCGGGA 
5880 

GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGA 
5925 

(2) INFORMATION FOR SEQ ID NO: 58: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids . 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Val Lys Lys Leu Leu Phe Ala lie Pro Leu Val Val Pro Phe Tyr Ser 
1 5 .10 15 

His Ser Ala Leu Glu Thr Thr Leu Thr Gin Ser Pro Gly He Leu Ser 
20 25 30 

Leu Ser Pro Gly Ala Gly Ala Thr Leu Ser Cys Arg Ala Ser Gin Ser 
35 40 45 

Val Ser Ser Arg Asn Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala 
50 55 60 

Pro Arg Leu Leu lie Tyr Gly Val Ser Asn Arg Ala Thr Gly Val Pro 
65 ' 70 75 80 

Asp Arg Phe Ser Gly Ser Gly Ser Gly Ala Asp Phe Thr Leu Thr He 
85 90 95 

Asn Arg Leu Glu Pro Glu Asp Phe Ala Val Tyr Tyr Cys Gin Arg Tyr 
100 105 110 

Gly Arg Ser Leu Trp Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 
115 120 125 

Arg Gly Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
130 135 140 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
145 150 155 160 

Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn .Ala Leu 
165 170 175 

Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp 
180 185 190 

Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr 
195 200 • 205 

Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
210 215 220 

Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys ^ 
.225 230 235 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 254 amino acids 

(B) TYPE: amino acid 
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(C) ST RAN DEDN ESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
1 5 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Val Gin Ser Gly Gly Gly 
20 25 30 

Val Val Gin Pro Gly Arg Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly 
35 40 45 

Phe Thr Phe Ser Ser Tyr Ala Met His Trp Val Arg Gin Ala Pro Gly 
50 55 60 

Lys Gly Leu Glu Trp Val Ala Val He Ser Tyr Asp Gly Ser Asn Lys 
65 70 75 80 

Tyr Tyr Ala Asp Ser Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn 
85 90 95 

Ser Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
100 105 110 

Thr Ala Val Tyr Tyr Cys Ala Arg Gly He Thr Val Thr Lys Ser Arg 
115 120 125 

Phe Asp Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser 
130 135 140 , 

Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr 
145 150 155 160 

Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro 
165 170 175 

Glu Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
180 185 190 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser 
195 200 205 

Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 
210 215 220 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys Val 
225 . 230 235 240 

Glu Pro Lys Ser Cys Ala Ala Ala His His His His His His 
245 250 



42 



EP 0 959 136 A1 



10 



15 



20 



Claims 

1 . A virus-like particle or gene delivery vehicle provided with a ligand capable of binding to a human amino acid trans- 
porter. 

2. A virus-like particle or gene delivery vehicle according to claim 1 for delivery of genes to human cells. 

3. A virus-like particle or gene delivery vehicle according to claim 1 or 2 comprising at least one viral protein provided 
with said ligand. 

4. A virus-like particle or gene delivery vehicle according to claim 3 wherein said viral protein comprises an envelope 
protein. 

5. A virus-like particle or gene delivery vehicle according to claim 4 derived from a retrovirus. 

6. A virus-like particle or gene delivery vehicle according to claim 3 wherein said viral protein comprises a capsid pro- 
tein. 

7. A virus-like particle or gene delivery vehicle according to claim 6 derived from an adeno virus. 

8. A virus-like particle or gene delivery vehicle according to any one of claims 1 to 7 wherein said amino acid trans- 
porter is a cationic amino acid transporter. 

9. A virus-like particle or gene delivery vehicle according to claim 8 wherein said transporter is a human cationic 
25 amino acid transporter-1 (hCAT1). 

10. A virus-like particle or gene delivery vehicle according to any one of claims 1 to 9 wherein said ligand comprise 
an amino acid sequence selected from Table 2, preferably from the last four different sequences of Table 2 or a 
sequence functionally related thereto. 

11. A virus-like particle or gene delivery vehicle according to claim 10 wherein said ligand comprises at least a part of 
the amino acid sequence SVSVGMKPSPRP. 

12. A virus-like particle or gene delivery vehicle according to any one of claims 1 to 9 wherein said ligand comprises a 
35 fragment derived from a phage displaying at least one antibody fragment selected for its capacity to bind with said 

amino acid transporter. 

13. A virus-like particle or gene delivery vehicle according to claim 12 wherein said antibody fragment comprises an 
amino acid sequence as shown in Figure 16 or an amino acid sequence functionally equivalent thereto. 
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